The Automatic Translation of Film Subtitles. A Machine Translation Success Story?

نویسنده

  • Martin Volk
چکیده

Every so often one hears the complaint that 50 years of research in Machine Translation (MT) has not resulted in much progress, and that current MT systems are still unsatisfactory. A closer look reveals that web-based general-purpose MT systems are used by thousands of users every day. And, on the other hand, special-purpose MT systems have been in long-standing use and work successfully in particular domains or for specific companies. This paper1 investigates whether the automatic translation of film subtitles can be considered a machine translation success story. We describe various projects on MT of film subtitles and contrast them to our own project in this area. We argue that the text genre “film subtitles” is well suited for MT, in particular for Statistical MT. But before we look at the translation of film subtitles let us retrace some other MT success stories. Hutchins (1999) lists a number of successful MT systems. Amongst them is Météo, a system for translating Canadian weather reports between English and French which is probably the most quoted MT system in practical use. References to Météo usually remind us that this is a “highly constrained sublanguage system”. On the other hand there are general purpose but customer-specific MT systems like the English to Spanish MT system at the Pan American Health Organization or the PaTrans system which Hutchins (1999) calls “... possibly the best known success story for custom-built MT”. PaTrans was developed for LingTech A/S to translate English patents into Danish. Earlier (Whitelock and Kilby (1995), p.198) had called the METAL system “a success story in the development of MT”. METAL is mentioned as “successfully used at a number of European companies” (by that time this meant a few dozen installations in industry, trade or banking). During the same time the European Union has been successfully using a customized version of Systran for its translation service but also later for online access by all its employees. Broad coverage systems like METAL and Systran have always resulted in a translation quality that required post-editing before publications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Translation of Film Subtitles from English to Spanish Combining a Statistical System with Rule - based Grammar

In this project we combined a statistical machine translation system for the translation of film subtitles from English to Spanish with rule-based grammar checking. At first we trained the best possible statistical machine translation system with the available training data. The largest part of the training corpus consists of freely available amateur subtitles. A smaller part are professionally...

متن کامل

Strategies Used in the Translation of Interlingual Subtitling

This study was an attempt to identify the interlingual strategies employed to translate English subtitles into Persian and to determine their frequency, as well. Contrary to many countries, subtitling is a new field in Iran. The study, a corpus-based, comparative, descriptive, non-judgmental analysis of an English-Persian parallel corpus, comprised English audio scripts of five movies of differ...

متن کامل

Using Linguistic Annotations in Statistical Machine Translation of Film Subtitles

Statistical Machine Translation (SMT) has been successfully employed to support translation of film subtitles. We explore the integration of Constraint Grammar corpus annotations into a Swedish–Danish subtitle SMT system in the framework of factored SMT. While the usefulness of the annotations is limited with large amounts of parallel data, we show that linguistic annotations can increase the g...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

Lexical Translation in Movies: A Comparative Analysis of Persian Dubs and Subtitles through CDA

The impact of translated movies has already been emphasized by quiet a number of researchers. The present study aimed to investigate the strategies of the Iranian subtitlers and dubbers of English movies in rendering English words. To this aim, three theoretical frameworks were employed: The researchers first used the strategies proposed by Venuti (2012) and Newmark's (1991) classification and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JLCL

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2009